Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them

نویسندگان

  • Bruno Laranjeira
  • Viviane Pereira Moreira
  • Aline Villavicencio
  • Carlos Ramisch
  • Maria José Bocorny Finatto
چکیده

Comparable corpora have been used as an alternative for parallel corpora as resources for computational tasks that involve domainspecific natural language processing. One way to gather documents related to a specific topic of interest is to traverse a portion of the web graph in a targeted way, using focused crawling algorithms. In this paper, we compare several focused crawling algorithms using them to collect comparable corpora on a specific domain. Then, we compare the evaluation of the focused crawling algorithms to the performance of linguistic processes executed after training with the corresponding generated corpora. Also, we propose a novel approach for focused crawling, exploiting the expressive power of multiword expressions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Comparative Effects of Self-assessment and Peer Feedback on Improving Translation Quality

This study investigated the effect of self-assessment and peer-assessment on the quality of students’ transla- tion. Participants of the study were 60 male and female students. They were selected from the senior stu- dents studying English Translation and divided into two groups: self-assessment and peer-assessment. The study adopted a pretest-posttest design, and students’ translation quality ...

متن کامل

The Teaching Methods in Translation Courses: Quality, Relevance and Resources

The study was intended to provide a description of the attitudes of English-major studentstowards the teaching methods in translation courses to find out more about the relevance andquality of methods to the students’ needs, concerning the necessary educational resourcesprovided in the methods of teaching. Accordingly, a multi-item Likert-scale questionnairecontaining 32 items was developed bas...

متن کامل

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

The Effectiveness of Emotionally-Focused Couple Therapy on Happiness and Quality of Married Life of Both Working Couples

The present study aimed to Determine the effectiveness of emotionally-focused couple therapy on happiness and quality of married life of dual-career couples. The method of study was quasi-experimental with a pre-test-post-test design with a control group. The statistical population included all working couples referred to Alborz Counseling Center in Karaj from the second half of October to the ...

متن کامل

Improving Learner Performance in Producing Grammatical Structures

This experimental study examined the effectiveness of using focused and unfocused tasks on Iranian intermediate EFL learners’ performance in producing noun, adjective, and adverb clauses. In addition,the aim of this study was to explore the effects of form-focused instruction and the feedback students received from their teacher after doing focused grammar tasks. Data consisted of the scores of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014